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HUMAN nucleic acid fragments, isolated from brain 

ADRENAL TISSUE, PLACENTA CR BONE NARROW 

Th: r . invention i c* 1 a t e ■: r o r. e v.- r, u c 1 e i c 
predicts cr pert ; : r, s thereof, which frag 
rjcl^c acid p^pj 1 2t :cr.: , ir.divid-jjl me- 
present in widely varying a.m-r.ts. 



rra;::ent5 encoding gene 
n t s are obtainable f r c : n h u m a n 
r .T5 c f such ccnuiaticns ceir: 



10 



15 



25 



Situations are increasingly arising m whicn it . s necessary to study 
complex nucleic acid cr po 1 y nu c 1 ec t ice pepu 1 a t ions . For example, it is 
new widely appreciated that an invaluable resource could be created if 
the r nr. ire sequence of the genomes cf organisms such as man were 
determined and the information, available. The magnitude of such a task 
should not, however, be underestimated. Thus, the human genome may 
contain as many as 1~0,CC'Q genes [a very substantial proportion cf 
which xay be expressed m the h-m.an brain (Sutcliffe, Ann. Rev. 
Neu r c o c i . 11:15 7 ( 1 9 5 r : i ) j - Only a very small percentage c f the stork 



cf hum, an genes has p resent] 
o i e c em e a 1 end u s u a 1 1 v o e c i f i 



o e e n e x p i o r e u , a 
ill-/ t a r c e t e d f a z r 



There has been much p.b' i: cerate a::r; t r . e nest r e a n s of a c r r c a c h i n u 
human genor.c sequencing. Brenner has arcjed (CI 3a Foundation Svrrocsiim. 
149: £ (1990)) that effcrts socle re concentrated on cLVNAs produced 
from, reverse transcribed mPNAs rather than on genom.io Dr.' A. This is 
primarily because rest useful genetic information resides i n the 
fraction of the aenome wnich cor re.- ponds to nRNA, and this fraction is 
a very small part c f the total ( 5 \ or less}. Moreover, techniques for 
generating c D N A s are also well kr.c w n . Got h e ether hand, e v e n 
supposing near per fee* recovery cf cTNAs corresponding to all expressed 
r.RNA s , some potentially useful in: or 
approach, including sequences resoona; 
genes. Nonetheless, the o 



a ^ ^ . ... -jl . 



sequences or n c n - c ~ d i 
;e.oc~ic DNA s eou e h c i r. - 



cation will re i o s t o y 
ole for control and regulation cf 
h at least ouostantially redo cos 
i.og fro--. analysis of repeated 




nucle;c acid <■ •:. ;>.».* '. e ; one o!hA : : i _ r tr.e croteic. secuences 
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other and to *:rc~n cequences in public databases, that each cf the 
cDNAs picked a- random, could be unambiguously classified. The cDNAs 
could be classified as being either entirely new or as corresponding, 
to a greater cr leooer extent, to a previously known sequence. cDKAs 
identified in this way were further characterised and found to be 
useful in a variety of standard applications, including physical 
mapping. Unfortunately, such a process is insufficient. The longer 
the process is pursued with any given population of cDNAs the less 
efficient it becomes and the lower the rate of identification of new 
clones. In essence, as the number of cDNAs which have already been 
picked rises, the probability of picking a particular cDNA more than 
once increases. This difficulty is exacerbated by the wide range of 
abundancies at which different cDHAs can occur, which abundancies can 
vary by several orders of magritud3. Thus, whereas some sequences are 
exceedingly rare, a single cDKA type may comprise as much as 10% of the 
population of cDNAs produced from a particular tissue (Lewin, B. Gene 
Expression, Vol. 2: Eukaryotic Chrcmosoxes, 2nd ed., pp. 703-719. New 
*ork: Wiley, I960). The need to avoid missing rarer species in ar.y 
given population presents a considerable problem. 



Various approaches ^no-called "normalisation 
tried in addressing the problem of increasing the 
examination of a mixed nucleotide population, for example, such 
population as is to be examined in human genome sequencing. 



techniques) have been 
ng the efficiency of 

a 



Thus, a standard PCR protocol can be used to amplify selectively cDNAs 
which are present at extremely low levels, if there is information 
abc t the sequence cf those cDNAs. If net, a pnrr.er specific to the 
de- red cDNA cannot be constructed and tne desired cDNA cannot be 
selectively amplified. The standard PCR method is therefore inadequate 
if it is desired to characterise a number of unknown genes. 



35 



40 



A second approach involves hybridization of cDNA to genomic CNA. At 
saturation, ^he cDNAs recovered from genomic / cDNA hybrids will be 
present in the same abundance as the genes encoding them. This will 
provide a much more homogenous population than the original cDN'A 
library, but does net entirely solve the problem. In order to reach 
saturation m respect cf the very rare sequences, it will ce necessary 
to use huge quantities of cDNA, which need to be allowed to anneal to 
large amounts of genomic CNA over a considerable r,pr-,^ n.* t- 
furthermore, cIjNA'j which are homologous to 



per iocs c i n.-.e. 
e s which are present in 



! 
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A third approach exploits the second order reasGcciaticn kinetics cf 
cDNA annealing to itself. After a lcng period of annealing, the cDNAs 
which regain single stranded will have nearly the same abundance, and 
can be recovered by standard PCR (see Patanjali, S.R. et al., PKAS USA 
88/ 1991, pp 1943-1947 ; Ko, M.S.H., NAR 19, no. 18, 1991, pp 5705- 
5711). The methods disclosed in these two publications, however, 
suffer from notable disadvantages. They are entirely dependent on the 
stringent physical separation of single stranded and double stranded 
DMA , require an elevated number of manual manipulations in each 
reaction, and necessitate protracted hybridisation times (up to 288 
hours in the method of Patanjali et al.) 



Yet a further approach in "normalising" a nucleotide population is 
described in co-pending Eritish Prtent Application No 91 15407.0, filed 
lb 17th July, 1991 by KRC , and involves a PCR process in which a mixture 
comprising a heterogenous DNA population and appropriate 
oligonucleotide prir.'.ers is first formed and the DNA denatured, but 
before effecting a conventional PCR protocol the conditions are altered 
to allow the denatured strands of the more common DN'A species to 
reanneal together, whilst avoiding annealing of primers to the DNA 
strands. By this means, rarer species can subsequently be amplified m 
preference to the more common species. 



This PCR normalisation method in general comprises the steps of: 

(a) preparing a mixture comprising a heterogenous DNA population and 
oligonucleotide primers suitable for use in a PCR process, in which the 
DNA is denatured; 

30 < b ) altering the conditions to allow the denatured strands of the 
more common DN'A species to reanneal, while preventing the annealing to 
■he primers to the DNA strands ; 



(c) further altering the conditions of the mixture in order to alio, 
the primers to anneal to the remaining single-stranded DN'A ccmrrioin: 
the rarer DNA species; and 

(d) carrying out an extension synthesis in the mixture ^reduced ir 
step ( c ) . 

Advantageously, the rethori consists of a rycl io aro 1 i c 1 1 on - i th>- 
above four s t e c s . 



WO 94/01548 



PCT/GB93/01467 



10 



It will be apprc;. ated that the conditions may be altered by th. 
alteration of the temperature of the reaction fixture . However , any 
conditions which ar'fect the hybridisation of complementary DNA strands 
to cne another ray be varied to achieve the required result. 

3ecause the reannealing efficiency of any given DNA species will depend 
on the product cf its concentration and time, the more abundant the 
sequence tne greater the extent to which it will reanneal in any given 
tine period. Once a DNA species has reached a certain threshold 
concentration it will no longer be amplified exponentially, as a 
significant amount will have annealed to the double stranded form 
before the priming step. Thus, as each individual DNA species is 
amplified by the process to its threshold concentration, the r?te of 
amplif ication of that species w:ll start to tail off. Ever.'.i.uUy, 
lb therefore, all DNA species will be present at the same concentration. 

The length of t : i&annealing step will determine how rr.uch DNA is 

present at the threshold concentration. Preferably, therefore, the 

duration of the reannealing £tep will be determined empirically for 
each DNA population. 



20 



In the PCR normalisation process in general, the DNA primers may be 
adapted to prime selectively a sample of the total DNA population. 3y 
using primers which will only prime a sample of the population, only 

25 that sample will be amplified ana normalised. The total quantity of 
DNA generated will thereby be reduced, which means that the cycling 
times can be kept low. This ensures that the method is applicable to 
complex DNA populations such as cDN A populations. In addition, a first 
primer can oe used which is adapted selectively to prime a sample cf 

3C the total cDNA popu 1 at if , and a second primer which is a general 
primer. Advantageously, the general primer is cligo dT (each primed 
rDNA will then be replicated in its entirely, as the olioo dT primer 
will anneal to the poly-A tail at the end of the cDNA). 

3 " In "^-pending British Patent Application No 92 14875.3, filed cv K?C 
13th July.. 1992, a new process i? described which allow 



e b:u- v 



identification ct the individual remoers ^f a mixed cr heterogenous 
population cf nucleotide sequences perhaps of varying abundance. I :\ 
preferred embodiments of the said procers, the starting nucleic acid 
0 population is t r ° ?. ted b v : 
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preferably an er.dcr.u c 1 ease whic:. has its cleavage and recognizer, sites 
separated, which reagent cleaves the nucleic acid so as to produce 
double stranded cleavage products the individual strands of which 
overlap at cleaved ends to leave a single strand extending to a known 
extent ; 



10 



(b) ligating the cleavage products from (a) with a population of 
adaptor molecules to generate adaptored cleavage products, each of 
which adaptor molecules has a cleavage product end recognition sequence 
and the population thereof encompassing a range of adaptor molecules 
having recognition sequences complementary to a predetermined subset of 
the sequences of the cleavage-generated extending single strands; and 



15 



(c) selecting and separ ting only those adaptored cleavage products 
resulting from (b) which carry an adaptor of predetermined recognition 
sequence . 



20 



25 



A preferred endonuc lease for use m step (a) of the above process is 
Fok 1. 

An important feature of th.s process is the use of adapter r.olecules. 
Tha adaptors used must have "overhanging" fragment recognition 
sequences which reflect cr are complementary to the extending cleavage- 
derived sequences which the adaptors are designed to react with. It is 
also preferred that the adaptors used should end with a 5' hydroxyl 
group. The avoidance of a 5' phosphate group removes tne risk of 
inappropriate ligation involving the adaptors. 



30 



Adaptor molecules 



contain 



portion permitting 



spec::: 



can 



sequence selection and separation (as in step (c) of the process) when 
a sequence is attached to the adaptor. For example, an adaptor 
.arry biotin, thereby permitting advantage to be taken cf 
biot in/ a v id i n 
reaction in select: 
Additionally, adapt 
sequence such that 
amplified by knc«: 
ocmolementarv to the 



ser 



a r a 1 1 n t 



ors preferably cc 
s p e z ■_ f . . c a 1 1 y i s c 1 a ! 
techniques (sue 
core sequence. 



' . ; d . . . ^ 0 

'■olccule: 



Preferably the ad a: 
which can re joined 



: c r s are 



".crt 



3 en c h r r: i c a : 
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and so that large ccr.cer.-r at car: ce easily proceed. They rr.ay also 

ne cherr.:cali- - ~cdi:_ed in a way which allows the- to be easily purified 
during the process. As r e n 1 1 c n e d dc:vc, ideally t h 3 1 r f ' e p a 3 will be 
unphosphoryiated so thot c.nce joined to fragments the adanocred end cf 
the latter will no lender be able to participate in. further ligation 
reactions. 



15 



It is preferred that the adaptor cleavage product end recognition 
sequences are on the 5' end cf the longest oligonucleotide strand 
10 r.aking up the preferred adaptor rolecules, are at least 3 nucleotides 
in length and with totally random bases at the single-stranded 
position(s) two nucleotides in from the 5* end. This then allows 
selection to be performed both during the joining reaction and during 
subsequent priming reaction.,. Then, because the final degree of 
selection is a result of the product of the degrees cf selection 
achieved at these two stages, raxirr.u~ selection can be achieved per 
ad aptor/pr irer available. 

Adaptor strand extensions cn the 5' eno cf the longest oligonucleotide 
20 also facilitate the use of codified oligonucleotides for separation 
purpose- Preferably, the snort oligonucleotide will be -edified at 
its 5' end. This has the double benefit cf requiring lust one r.odified 
oligonucleotide for all possible s mg le- stranded extensions that are 
used, and also placing the modification at a position where it cannot 
interfere with ligation or subsequent priming reactions. 

Although only ere type of adaptor is required per ligation reaction, it 
is preferred tnat adaptors cover .ng all possible reactions in a chosen 
subset o: sequences be present, because then the opportunity for 
fragments in the chosen subset to ligate to each other is minimised. 
It is also preferred that the chosen specific adaptor, carrying a 
predetermined recognition sequence, should not only be different fro- 
the other adapters in its s 1 n g 1 e - s t r a n c e d extension, out also different 
~n the rest ct its sequence 31 nee this allows orientation to oe 
introduced : ? useful 1 r. s u h r. e o - <.:• : . t '-ten o . It 1 5 t n e r e fere a " ~ - 

preferred that tms adaptor has => modified o 1 monuc 1 eot ice to 
facilitate its reparation with t:.e cle---.vino crozuots -0 which it -cms. 

The aoove " acap t c r 1 n g " process ran c e used to generate categories or 
subsets of sequences by :a^o; :: toe adaptors specific m sone 
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chosen, e.g. 
facilitates, 
sequences b ^ 



? m subsequent nucleotide sequencing. This 

jla, the identification cf a large population of 

rational approach to splitting such 

populations ir.tc subsets, each of which subsets can be examined in 
turn. 



: c r exa.T 

rermittmg 
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In the light of these developments, the present invention now provides 
a nucleic acid fragment encoding a gene product or portion thereof and 
con. prising any cne of:- 



a sequence selected from SEQ ID Nos 1 to 1193; 



(b) an allelic variation of a sequence as defined in (a); or 

x ^ ( c ) a sequence complementary to (a) or (b). 

In another aspect, the invention provides a nucleic acvd sequence as 
set out in any one cf SEQ ID Nos 1 to 1193, or a complement or allelic 
variation thereof. Preferred sequences exhibit no more than 90t 
20 homology to a human sequence known per se. 

In a further aspect, the invention provides a nucleic acid fragment 
comprising a portion of a sequence as defined above of sufficient size 
such that a prcbe cf the same size and exhibiting complementarity to 
25 said portion can hybridise to said sequence. Preferably, such portions 
are at least IS bases in length. It will be appreciated that minor 
mismatches in .he aforesaid "complementarity" are not excluded provided 
hybridisation can still occur. In general, hybridisation conditions 
are within the choice of the skilled person, but reference can be made, 
30 for example, to the following; Melting temperature cf hybrids - 
Bolton, E. T. and McCarthy, B . J. Proc. Natl. Acad. Sci, 48 pl390 
(1962). Effect cf formamide on lowering melting temperature - Casey, 
J. and Davidson, N - , Sucleic Acids Res. 4. M^77 
imperfect hcmolcrr. 
(19 7 3). General - 
(1934 ) . Oiigo r.yz: 
PI ( 1935 ) . 



35 



- donner, 
y e i r. k o t h , J 
idinaticn a r 



Res. 4, pi 539 (' 
I . e t a i . , J . M f 1 
a n d W a h 1 , G . Anal. 5 
washing - Lathe, R. . 



Effect oi 
51 , vl2: 



40 



The present 
fragments o 
regulator y e 



.rH v e £' L c r i_ - a _L _ 5 °. ... S n >; i s a u_e s DN. 
se ^-?nces as referred to a; 



constructs 
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The :nve-:cn include. _ch DNA constructs using a gene S) ster, known in 
tn. art ligated to a -^ence cr fragment of the invention so as to 
enable, upon expression, the provision of a fusion polypeptide. 
Preferably, an endopept idase recognition site is provided such that 
when the sequence cr fragment ;s expressed it is expressed m frame 
with a known protein with the boundary be.r.g a cleavage site for an 
endor >ptidase with a rare cutting site. The known protein can then bo 
affinity purified, and the peptide corresponding to tae fragment or 
sequence m accordance with the invention may be released by tne 
endcpeptidase. Alternatively, the whole protein can be used to r^ise 
antibodies which can then be screened for those directed at polypeptide 
corresponding to the fragment or sequence of the inven.ion. 

Since the present fragments and sequences can be used to produTe, inter 
15 alia, corresponding genes, whether by isolating them, by synthesis or 
otherwise, such use and the resulting DNA fragments comprising genes 
are further aepectr of the invention. 

'ret another aspect cf the invent ^n is an expression vector comprising 
20 a fragment, sequence, gene-compr i s i ng DNA fragment, cr DNA construct, 
as above, positioned ^ch that that nucleic acid sequence which 
encodes the polypeptide corresponding to said fragment, sequence or DNA 
fragment is m operable reading frame with a control or regulatory 
sequence . 

25 

Other aspects cf the invention are host cells incorporating a sequence, 
or fragment, cr gene-comprising DNA fragment, or UN* A construct, as 
above, as a heterologous part of the expressible genetic information cf 
the cell. The production of such modified host ceils can be achieved 
20 using methods known m the art. Such modified host cells can be used 
to express corresponding proteins, and these materials lend themselves 
in turn to the preparation of corresponding monoclcnal or polyclonal 
antibodies using standard techniques. 

is A:^ included m this invention are such antibodies. Poferer.ee can ho 
made, inter alia, to the following literature; Monoclonal antifcodies, 
Carr.bell, A. M. Moratory Techniques m 3 i o cnem i s t r v , and Molecuia- 
Biology Ed. Burden, ?. H . and van Kmpper.berg , ?. H . vol 13. Elsevier 
Amsterdam 195s. Coding, J. w., Monoclonal antibodies: Principles and 

40 Practice, 2nd Edition, academic Press, London 1366. Kipps, ?* J. and 
Herzenberg, L . A . , P^r.dbock cf -.x ter im-n fc a 1 ' ~~ ,j - - ■ o - ■ ' - a~ - ' < ~ a«- * — 
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D. M . et al., plC8 Bla.^well scientific Publications, Oxford. Ha r lew, 
E and Lane, 2. Antibodies, A Laboratory Manual, Cold spring harbor 



Laboratory, Cold Spring, New York. 



10 



15 



20 



25 



Expression in an appropriate higher eucaryotic nest may be important to 
ensure correct protein folding and also activity. Expression to avoid 
copurif icat ion of toxic products can sonetines be better performed in 
organisms approved for human consumption, eg prokaryotic Bacillus 
subtilis, eurkaryctic yeast, mammalian cows milk vectors, and other 
methods known in the art. 

The invention also includes novel gene products or portions thereof 
encoded by a fragment, sequence or gene-comprising LHJA fragment of the 
i nvent ion . 

It will be appreciated that the sequences of the present invention 
collectively have ••tility based, inter alia, upon their common origin, 
and hence they can effectively be considered together rather than as 
separate entities. It is convenient to represent them as separate 
sequences, because this is how they were produced and serves as 
"punctuation" between the different functional entities which each 
sequence represents. However, the sequences could just as easily have 
been presented as a continuous sequence derived by placing then end to 
end in the order in which they were produced, with a separate 
indication of where the beginnings and ends of the component sequences 
are . 



30 
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In contrast to investigations hitherto, where gene fragments (sequence 
fragments) could enly be identified through some known characteristic 
[for example: their homology to a fragment i-M.ch largely encodes amino 
acids identified by sequencing a previously isolated peptide cr is the 
antisense of that coding sequence; or them having at least partial 
homology to previously characterised nucleic acids; cr them havir.a 
ability to encode expressed proteins which could later be detected tv 
functional assays of the cells expressing those prcteir.s cr bv usir.c 
antibodies which had been previously raiccd acamst the orctems to 
detect their expression, Sambrook J., et al., Molecular Cloning CSH 
Press 1989], the sequences and fragments described by the present 
invention are entirely underivable and unpredictable from the prior 
art, but are nonetheless clearly cf great value for various cur coses. 



:n s f u e n - 
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" a r?1 ^ fcr ^termming the existence of members c-' 

eXi " ln? ?en ° :3r ' i:ier ' rew h ^ an 3^' es previously only nen-hu««n 

ge.es were ; : n=wn. ,,d r.,w ,.„.. when previously ,o were Known 

(Karli.n, S. . ., Altschul, S. y. Prcc. ,, 3ti . Acad . Sci . £7 c226 -_ 2263 

U980,) - :n ali C3SeS ' this all ^ s ^ s -solution of the corresponding 
genes and the:r products, and ne.nce enables the manufacture C 
r.olecuies of potential biological interest by recombinant rr^ears 
Screening libraries of known materials or hitherto unexplored source 
materials for biological efficacy is now an important industrial 
10 activity in the search for new therapies and therapeutics. Knen new 
sequences have already been found to have counterparts in gene families 
or in non-human genes then knowledge about biological efficacy nay 
already be appcrant. For example, new rectors or receptor 

agonists/antagonists may exhibit differences t. known instances o< 
ib theoe molecules, and such differences could Mke the., ,.ore suitable as 
therapeutics by, for example, exhibiting binding characteristics vhi- 
are ,= re in Ke.-.r.g w::„ avoidr.nce of toxicity. Reference can re „ de 
for example, to polymorphic dopamine receptors and the implications i~l 
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20 M. 



cental health C verse.-., Nature 353, P 109 ( 1992,, and van To 1 , H T 
NatUre 358 ' P 149 " 1 " U«2)). Where absolutely required. 



realisation oi full lenoth cPNAs f or expression can h, bv 
'- e 5e ^ = ° 5 " screen , bv hybridisation c -,,.,k^ ^. .T 
libraries containing full ienqth, c]on P s (D'Alessicn, J. u e - a' 
Focus (Gibco 3.P..L, 9 ol ,1937)). Alternatively, the sequence can be 
used to design primers suitable for obtaining the rising sequences bv 
PCP or other amplification methods (frcn.man, M . A., Dush, M Y and 
Martin, G . P Prcc. Natl. Acad. Sri. SS =3996-9002 -1933,,. 

Appropriate use cf tne sequence fragments in antisense or trir-le r 
(Griffin ct al., Science 245 P 967-971 (1589,, applications w- 
useful for identifying manipulable targets related to disease 
example, viruses have been inhibited by antisense ?..'<'.-. to th^ - - = 
<Cnan 7 , ,: ; .--.It:,,, c. ,. Virol. F 9i:-97, < :937ll . .. 



be 
For 



v i r 'j 3 . 



f.-cc-r.tia; : : -re— or ~ -. : : - .... 



Partial cr :'u'~ 'o ■ .. - . „ 

'• " - - 3 " e - e C '^' ----- -v -co r-.n-Pc- ~,„ 



manner of expression can CO selected by c-.e skilled in - -.e a-- s --- 

tr,G lr;t9n = 0d -.licaticn. Kxprecsio.-, rf full length c ~ - l£ tvni-al'v 

r ^ ^ j i r c f - r ■ - * 



Ij) 







WO 94/01543 



PCT/GB93/0I46" 



10 



15 



which 



raised. 



t h e res-.:s c f t r. e ass a v 
cancer. For cxar.slc 
therapeutic a cents. Tr. 
monitor markers cf u nd i f f er e.nt i at ed 
or i mmu r. e d::ea"p or ^ r h 



n be correlated 
m l r k e r 3 rr a v b ^ 



reprices a o a i r. s c 
' beneficial wr.e r. 



-r.d and used to target 
antibodies can also be used to detect or 
rowtn, mf ect or. , cardiovascular 
a3e or a therapeutic respcr.se. When the antibodies 
recognise cell surface prctems they can be used in isolation or m 
combination to isolate particular populations cf cells. These in turn 
can be used to isolate yet m.ore cDNAs which will be enriched for yet 
more cf such surface markers for the population, which, if similarly 
11 permit yet further subdivision of the population, 
arels cf antibodies which can describe particular dise^ 
states will accrue. Such antibodies could be tailored 
applications 



screened , 
Ult imatel v 



.■ell as 



uaar.cstic 



and ri; 



scdrjc rtc.nitcr 



asp 
orens i c 
c r i n o . 



i e secuencc- 



25 



fragment-- can also be used for gene: 
.capping, for exarr.pl e # to diagnose 
individual is predisposed toward 
event nf a serjue. 



a n a * y s i s and 
the likelihood chat a given 
given genetic disease. In t he 
:e cc-iccating, genetically, with a disease ccne, it 
can be used for tne derivation of new disease therapies bases upon 
precise 

the techniques of so-called "gene thera: 
p455-460 ( 1992 ) ) . 



:cr ex a :r pie, 
:y filler, A. Nat- re 35 7 



Antibodies can ce produced against tne protein cf a genetic disease 
with sufficient discriminating power to discriminate oetween ciseased 
and ncn-oiseased states (Caskey, T. Genome Seouencmo Con: 
Hilton Head, S. Carolina (1991)). This would be useful 
the dependence of su; 
antibodies aire have 

expression cf : re prcteir., fcr example level: 
be i rr o c r t a r. t i r r d e v c 1 1 c r ~ r ' -~ *~ ~ 



-ests on nucleic arid-based 

advantage- c f 1 1 c w i r. o detection 

c x p r e s s i c - 
' > m slow enect z 



e r e n c e , 
fcr red u cm c 
r e e n s . Such 



Also very i m p r r t a n t is that n : 
conventional r e a n 3 , w r. e r e a s t h 
'comprehensive"'. Tne use cf t 
necessity to truncated clones i 
will be do-, : free c f a 



■ i 



ease: 
; o q u e r c e o t h 



sequences are, in c ~. e sense 
3 of c H N'A s which crrr e- : r ends o ; 



WO 94/01548 



PCT/GB93/0I467 



20 



13 

c lone . 

This invention will now be further described and illustrated by means 
of the following Examples. 

5 

All oligonucleotides used in these Examples were synthesised Trityl on 
using an A3I 3803 DNA Synthesizer accordin to the manufacturers 
instructions. Purification was by reverse phase HPLC (see, for 
example, Becker, C, R. , et al., J. Chromatography 326, p293-299 
10 (1985)). 

Example 1 

Human brain and adrenal tissues ware obtained from a mixture of 12 to 
15 15 week menstrual age foetuses and then snap frozen m liquid nitrogen 
before storing in bijou bottles in a -80*C freezer. The two types of 
tissue were used separately, directly from the freezer, to prepare cDNA 
from which restriction fragments were generated for sorting mto 
subsets. ig portions of each of the separate tissues were homogenised, 
using an Ultra-Turrax T25 Disperser (Janke and Kunkel, IKA- 
LabortechniV* , on ice the presence of 4M guanidinium isothiocyanate 
to solubilise macromolecules. RNA was isolated from each homogenate by 
using centrifugation to sediment it through caesium tr if luoroacetate . 
This was performed using the Pharmacia kit according to the 
15 manufacturer's instructions, except that cent r i fugat ion was performed 
for 36 hours and the RNA obtained was finally dc3alted and concentrated 
by performing two ethanol precipitations in succession with two 70% 
ethanol washes after each precipitation. In each case, poiyA" ( mRNA ) 
was isolated from 2C0 to 4C0 pg of the total RNA by binding it to 
magnetic oligo-dT coated beads (Dynal). Solution containing unbound 
material was removed from the beads, which were washed, and then mRNA 
eluted directly for use. mRNA isolation was performed m accordance 
with the manufacturer's instructions. Yields of RNA from the beads 
were between I and 3% of the total RNA. 2 to 4 pg of the eluted rna 
were used for cDN'A synthesis. cDNA synthesis was performed according 
to the method of Gubler, U and Hoffman, (3. J . Gene 25 P 263 (1983) 
using a Pharmacia kit according to the manufacturer's instructions. 
OligodT was used to prime the first strand cDNA synthesis reaction. 
The cDNA was purified by extracting twice with phenol/chloroform and 
then low molecular weight solutes including nucleic acids below C /». 300 
bases were removed by passing the cDNA reaction mixture thrcu-h a 
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instructions. Running buffer for the colunn comprised 10 mJ-1 TrisHCl, 
1 rnM EDTA , 50 mM NaCl 0 pH 7.5. 

The column eluate was adjusted to 10 n\M Mg :+ and then the purified cDNA 
5 was restricted by the action of 1 unit per 10 ul of the endonuclease 
Fok I at 37'C for 1 hour, so that it would b- able to accept adaptors 
to enable fragment sorting. 

The cDNA fragments were purified by two successive phenol /chloroform 
10 extractions followed by passing then through S400 spun columns ae 
described above. 

The adaptors used were oligonucleotides 5* N 4 N 4 N 4 N 4 TCCTTCTCCTGCGACAGACA 
(SFQ TD; 119 4} with the* mmnl pmpnt atu qfr^nrl r» TnTrTr.Trr.rar.rii.r.aar.ra 
15 (SEQ ID: 1195) and 5' AAN,N/TCTCGGACAGTGCTCCGAGAAC (SEQ ID: 1196) or 5* 
TTN 4 N,TCTCGGACAGTGCTCCGAGAAC (S2Q ID: U97) each with the complementary 
5' biotinylated strand GTTCTCGGAGCACTGTCCGAGA { SEQ ID: 1198). These 
were added to 25% of the eluted material by incubating together 200 
pmoles of the mixture of double -stranded adaptors in the eiution buffer 
to which had been added MgCl : to lOmM, ATP to 10mM and 0.025 units/pl 
of T4 DNA ligase. The oligonucleotide 5' biotinylated 

GTTCTCGGAGCACTGTCCGAGA, (SEQ ID: 11S8) and whichever of the 
complementary oligonucleotides with which it was used, each comprised 
1/32 of the molar proportion total adaptors. The final reaction volume 
25 was 90 ul which was heated to 65°C for 3 minutes and then cooled to 
room temperature before the ligase was added. Ligation was performed 
for 16 hours at 12°C. 
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30 



Two successive phenol /chloroform extractions were performed to remove 
the ligase. The final aqueous phase was passed through an S400 spun 
column (Pharmacia) as described above except that the column was used 
with 10 rr-M Tris pH 8.3/53 mM NaCl . 

The column eluate was adjusted to 2 5mM Mg2 + , 0 . 5>r» dNTPs in a final 
35 volume of 2C0 u l . The mixture was placed in a thcrmocycler (Techr.e 
KW2 ) and hected to 7 3 ° C for 5 mir.utec. At this poir.t 10 u^it«= ^ f 
cloned Taq DNA polymerase (AmpliTaq, Perkin Elmer) were added. This 
w - s followed by an incubation at 72^C fcr 10 minutes to fill m the 
unlxgated strand of the adapter. After the second incubation 200 ul of 
40 streptavidm coated magnetic beads (Dyr.al) prepared according to the 
manufacturers instructions were added to bind cDNA ligated to that of 
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TTCTCGGAGCACTGTCCGAl-a {SEQ ID: 1198} biotinylated adaptor. Bead 
binding was allowed to proceed at 28°C for 30 minutes with mixing every 
10 minutes. 

5 Un-biotinylated cDNAs were washed from the beads with 400^1 each of 2M 
NaCl twice, fresh 0.15 mH NaOH four tirr.es at 28-"C for 5 minutes each, 
water twice and finally a buffer comprising 20 mM Tris pH 8.3, 50 mM 
NaCl, and 2SmM M 9 : ". The beads were then resuspended in 240 ^1 of the 
final buffer including additionally 0.5 mM dNTPs and divided into 4x60 
10 nl. 

Four of the 60 pi aliquots, two from each tissue, were processed 
further specifically to prime and copy a subset of the immobilised, 
adaptored f ragrr.er.ts. 2 pmoles of the primer S- CTCTCTGTCGCAGGAGAAGGAA 
(SEQ ID: 1201) were added to each of two aliquots, one from each 
tissue. 2 proles of the primer 5' CTGTCTGTCGCAGGAGAAGGAG (SEQ ID: 
1202) were ac-eJ to each of the other two aliquots. 2.S units of Tag 
DNA polymerase were added to each reaction and 16 cycles of alternate 
denaturation at 95'C for 30 seconds, annealing at 63»c for 2 minutes 
and polymerisation at "2»C for 3 minutes was performed to accumulate 
the selecfed single strands in solution. 



On completion of the DMA svnthesis reactions a further 30 pi of 
resuspended beads were added to each reaction to removo the 

25 biotinylated fragments. The reaction was incubated at 28°C for 30 
minutes mixing every 10 minutes to ensure that the biotinylated strands 
were bead bound. Each aqueous phase containing the newly synthesised 
strands was then removed and extracted with phenol/chloroform twice to 
remove the enzyme before being further purified by passing through an 

30 S400 spun column equilibrated with 10 mH Tris P H 3.3/S0 mM NaCl as 
described above. 

Rounds of PCR amplification of subsets of the selected fragments were 
performed by using the original primer in each case, together with one 
35 of the primers 5' GTTCTCGGAGCACTGTCCG AGAG (SEQ ID: 1199) or 5' 
GTTCTCGGAGCACTGTCCGAG AC SEQ ID: 1200). This Simultaneously rendered 
the fragments double-stranded and increased the amounts of available 
material. It was not known how many cycles of amplification would be 
required at this stage, since each primer pair would be expected to 
40 behave differently. it was therefore necessary directly to determine 
a suitable number empirically by using standard agarose gel 
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of cycles. In sorr.e case^ to avoid the accumulation of non-specific 
products, it Wd. neces y to perform an initial 5 cycles of 
amplification with both o. :he primers present at 2 proles each. All 
reactions were performed using 3 ^ 1 or 12.5 % whichever was the larger 
5 but not exceeding 12 pi of ti>z column effluent abo/e. Reaction 
conditions were adjusted to 20 mM Tris pH 8.3, 50 mM NaCl, 2 5mM Mg : *, 
0.5mM dNTPs and 2.5 units of Taq DNA poj^erase in a final volume of 40 
pi. Apart from when an initial amplification with 2 pmoles of each 
primer was performed, 20 pmoles of each primer were used. Cycles of 
10 amplification were performed at 95°C for 30 seconds, 65°C for 1 minute 
and 72°C for 3 minutes. 



For the purposes of cloning, selected cDNA was amplified as described 
immediately above, except that the reaction was not monitored. 
Instead, the number of cycles which had previously been shown to just 
give rise to all observable products plus another 4 cycles were 
performed. In addition, an extra 72°C for 10 minutes incubation was 
performed after the last cycle. 



20 The products of the reaction were then prepared for directional 
clcni-j. Water was added to adjust the final reaction volume to 60 pi. 
Enzyme w? 3 removed by two successive phenol/chloroform extractions. 
The fin;il aqueous mixture was passed through an S400 column as 
described above, except that it had been equilibrated with 10 mM Tris 

2 5 HC1 pH 7.5, 5 CmM NaCl . 

For directional cloning, advantage was taken of the different known 
sequences introduced at each end cf the selected cDNAs by the adaptors 
in a modification of the method of Aslandis, C. and de Jong, P. J. 

30 (Nucl. Acids Res. 16, p6156 (1990)). Different cohesive ends were 
produced en each end by using the excr.uclease activity of T4 DNA 
polymerase to resect from the 3' end, to the first T in each case. To 
75 pi or 75 % of the column eluate, whichever was least, were added 
9.5 pi cf ICOm" TrisHCl pH7 . 4 , iq 0 rtM KgC12, and 9.5 pi of 0.5 mM dTT? . 

35 16 units cf 7s DNA polymerase were added ar.d the reaction incubated in 
a water batn at 37°C for 30 minutes. The enzyme was removed by 
extracting with phenol / c h 1 oro f crm , twice successively. The salt cf the 
final aqueous phase was adjusted by passing it through an S300 column 
(Pharmacia) equilibrated with 10 m,M TrisHCl pH 7.4, 1 mX EDTA as 

40 described above. 
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and Short J. M., Nucl Acids Res. 17 p9494) was prepared for accepting 
the resected cDNA by restriction cleavage at the Barrel and HindHI 
sites and then adaptoring the resultant cohesive ends using the 
specific adaptors produced by the oligonucleotide 5' AGCTCGGCTCG AGTCTG 
(SEQ ID: 1203) with its partially complementary oligonucleotide 5* 
C CGACAGACAGC AG ACTCG AG CCG (SEQ ID: 1204) and the oligonucleotide 5' 
GATCCGGCTCGAGT (SEQ ID: 1205) with its partially complementary 
oligonucleotide 5' CCG AG AAC ACTCG AG CCG (SEQ ID: 1206). Preparation of 
the vector and adaptoring were performed according to standard 
procedures. Insertion of the cDNA was performed between the BamHI and 
HindHI restriction sites. Recombinant vectors were transformed into 
the host XLl-Blue (Bullock, W. O. et al Biotechniques S p376-378 
(1987)) by the method of Hannahan, D. J. (Mol. Biol. J66 p577-580 
(1983)). Suitable standard controls for the ligations and 

transformations were aiso included. 

Post transforation procedures were as described in "Molecular 
Cloning", 2nd Edition (Sambrook J., Fritsch, E. F. , ana Maniatis, T. 
CSH Press (1989)). Colonies were produced by plating onto X-gal/lPTG 
L-agar plates containing 50j;g/ml ^.picillin and lOpg/ml tetracyclic 
Clear colonies were picked, each into a separate well cf a microtitre 
plate, containing 100^1 of L-broth and 50^g/ml ampicillin. Growth was 
allowed to occur for 16 hours at 37°c. lOOpl of 50% or 30% glycerol 
was added to plates which were archived at -20°C or -80 C C, 
25 respectively. 

Bacteria corresponding to those archived were used for preparing 
templates for sequencing by the dideoxy method (Sanger, F. Kilklen, S. 
and Coulson, A. R. Proc. Natl. Acad. Sci. 74 p5463-5467 (1977)). 

30 Bacteria for this purpose were either grown on L-agar places containing 
50pg/ml of ampicillin, prepared at the s^e time as they had been grown 
in liquid culture, or after plating out from the archive. 
Alternatively, fresh liquid cultures were inoculated from the archive. 
In all cases, cDNA inserts were amplified fcr sequencing by PCR 

35 (Saiki, R. K. et al Science 239 p437-491 (1933)). PCR was either 
performed using bacteria directly added to the reaction, by a 
toothpick, or PCR was performed using l/50th of the plasmid isolated by 
preparative methods (Holmes, D. S. and Quigley, K . Anal. Bicchem. 114 
p!93 (1931)) from the bacteria m the liquid cultures or from the 

40 plates. 

20 prroles of each cf the PCR primers 5' biot ir.ylated GTAAAACGACGGCCAGT 
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(SEQ ID: :::?) and 5' CGA G7CCACGGTATCG (SEQ ID: 1203) were used in 
40^1 reac::.:r.s containing ± . S.mM Kg : ', 50 rnM KC1, Tris-HCl pH 8.3 an 
0.25 units cf Amplitaq (Cetus). Reactions were performed at 95°C, fcr 
i minute, followed by 35 cycles at 95°C for 30 seconds, 60°C for 30 
seconds and 72°C for AO seconds. After the cycles, a final incubation 
at 72 C C fcr 5 minutes was performed. 

After PCR, standard agarose gel electrophoresis was used to determine 
which reactions had been successful. The biotmyiated strands of 
successful reactions were then recovered for single-stranded sequencing 
by binding them to cteptravidin coated beads (Dynal) and then washing, 
all according to the manufacturers instruct i- -.s , except that the 
washing steps were either performed manually or p rfcrmed automatically 
in the 96 well microtitrc plate format using a Bionek robotic work- 
lb station attached to a side-arm loader (3eckman). 

Dideoxy cha..-. termination sequencing reactions were performed using the 

immobilises, biotmyiated strands as templates and 2 pmoles of the 

oligonucleotide 5' CG AGGTCG ACGGTATCG ( SEQ ID: 1209) as primer. 

20 Reactions were periormed using f luorescent 1 y- 1 aoe 1 led terminators (Du 

Pont* or a f 1 u oro£ce in- 1 abel 1 ed primer (Pharmacia) according to the 

manufacturers instructions. Reactions were analysed using automated 

DKA sequencers. A Genesis 2000 was used fcr the "Du Pont" reactions 

and an A.L.F. for the "Pharmacia" reactions. Bases were assigned for 

the Genesis 2000 reads using the manufacturers Rase Caller software. 

Files of called bases were then transferred :o a GUN Network from an 

Apple Macm: computer which had been used for oase calling. Raw 

data from the A.L.F. reads was directly transferred to a SUN networK 

where bases were called using the public domain "trace editor software" 

30 (TED). m both cases, files of called bases were entered into a 

Sybase™ database. Entering data entailed automatically removing vector 

and adaptor or linker sequences, but not editing ambiguous bases. 

After removal cf the unwanted bases, files were automatically compared 

co ocher sequences m the cDNA database and the latest versions cf the 

3 - publicailv available databases g-k^z^y = cvtcc-d-^- c 

-- j ~-~^-''-->, ^ , » ^ . . r. 3. ^ b^iibrr.'j. , Searches ore 

performed with the "basic local alignment search tccl" (BLAST) (Karim, 

S. ana Altscnul, S. F . Natl. Acad. S:;. 57 p226^-226S (1990)). 

Sequences SEQ ID Ncs 1 to 610, given hereinafter, were obtained by the 
4 0 above procedure. 
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A second ir.ethod of preparing cDNA libraries for cbt;ar.:ng gene 
fragments o: the invention took advantage of the PCR normalisation 
process described above. Standard procedures were used .o prepare mRNA 
from RNA that had been isolated by standard caesium chloride bou/ant 
density gradient methods from a full term human placenta. The 
oligonucleotide LNotdt, sequence 5' TACGTTCGACAAGCTTGAATTCGf :GGCCGC (T ) 2£f 
(SEQ ID: 1210) was used at 1 with AMV reverse transcriptase, to 

prime first strand cDNA synthesis under standard conditions from 0.5 ^9 
of the placental mRNA . Temperatures above 65 'C were ured to inactivate 
the reverse transcriptase and then the volume of the reaction made up 
to 100 fjl with water. 
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PCRs were then performed in reactions containing 1 / 1 of the diluted 
cDNA, 10 mH Tris-HCl 8.3, 40 mM KC1, 1.5 mM KgCl : , 0.01% gelatin, 200 
fjK dNTPs , 10 uCi a"'P dCTP, 1 y M each of the primers 11AD1, sequence 5' 
GCC(TA)(GC)CGCCGA (SEQ ID: 1211), and LNotdT and Tag DNA polymerase. 
An initial dentuuration period of 95*c for 90 seconds was followed 
either by 35 cycles of standard PCR , comprising 95 "C for 30 seconds, 
45 "c for 30 seconds and 72 *C for 30 seconds or alternatively 3 cycles 
of the standard PCR already described followed by 27 cycles of Cot 7 CR 
during wMch an additional step of 72'C for 16 minutes was placed 
between all of the 95*0 and 45'C steps of the standard PCR. The 
standard PCP was followed by a single 72 *C for 3 minutes step while the 
Cot PCR was followed by one standard PCR cycle except that the 72 *C 
incubation was performed for 3 minutes. 

Products of the PCR reaction were end repaired by adding 5 units of T4 
DNA polymerase to the reaction and then incurring at 37 'C for 10 
minutes. Enzymes were removed by phenol extraction. The cDNA was 
precipitated by 70% ethanol, dried and then resuspended m Not I buffer. 
20 units of Not I were used to digest the cDNA under standard 
conditions. cDNA was again phenol extracted and ethanol precipitated. * 
10% of the purified NotI cut DNA were ligated to the vector pBluescript 
lt-ng-Meese, M. A. ar.d Short J. M. Nucl. Acid Res. 17 P 9494 which had 
bp?n Prepared as standard to receive this DNA by restricting with the 
enzymes NctI and EccRV. Transformation and processing of clear 
colonies was performed as described above except tnat the host E.col i 
strain DH 5a was used in place of XL-1 Blue. 

Preparation of clones for sequencing, sequencing and sequence analysis 
of cDNAs in clones thu s- produced were performed as cescnoea m hxamole 
1 . 
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Sequences sr; ID Ncs 611 to 772, given hereinafter, were obtained b> 
the above procedure. 
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Example 3 

cDNA libraries corresponding to adult brain cortex (Clontech 
Laboratories, Inc., Cat No. HL10036) and adult bone narro.- (Clontrech 
Laboratories, Inc., Cat No. HL105S6) prepared in lambda gtll phage were 
transfected into g.coli Y1090 and plated out for colour selection of 
recombinant plaques ("Molecular Cloning", 2nd Edition Sc^nbrook J., 
Fritsch, E. F . , and Maniatis, T. CSH Press (1989)). 192 lambda Zap 
clones, corresponding to rhabdomyocarcoma cDNAs and a gift from c. 
Cooper, ICR , Sutton, were similarly plated except that the host XL-1 
Blue was used. 

Clear plaques from each library were resuspended in 5 pi of Tris-HCl pH 
8, 1 m.M E?7A. 2 ul of the r ;ltant phage suspensions were added 
directly to PCRs for the purpos. of amplifying the cDNA inserts for 
sequencing. PCR was performed as described in Example 1, except that 
vhe oligonucleotides used as primers for the lambda gtll clones --ere S 1 
CGTGC-GACCACTCCT^GAGCCCG ( SEQ ID: 1212) and 5' TTGACACCACACCAACTGGTAATG 
(SEQ ID: 1213). Whicheve- of the oligonucleotides was to be used to 
prime the strand which would serve as the sequencing template was used 
in biotinylated form. 

Preparation of clones for sequencing, sequencing and sequence an?.lysis 
of cDNAs in clones thus-produced was performed as described in Lxample 
1, except that 2 pmoles ci the primers that were unbiotinylated in the 
PCR were used as sequencing primers. 

Sequences SZQ ID Nos 773 to 1193, given hereinafter, were obtained by 
the above procedure. 

The foil ow i r i are th° s rn ' ir \T r ^ r ^ r ^ J >:r^i , .v,,,-h -^~~v-,.-~ ^ _ „ , 

, ^ °- * - ~- b * A wnich :c,"pri5G sequences SEQ ID 

Nos 1 to 12 13 referred to hereinbefore. Certain of these secruer.ces are 

preferred, are listed as such after tne main SEQUENCE LISTINGS. 
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CLAIMS: 

A nucleic acid fragment encoding a gene product or portion 
thereof and comprising any one of:- 

(a) a sequence selected from SEQ ID Ncs 1 to 1193; 

(b) an allelic variation of a sequence as defined in (a); or 

(c) a sequence complementary to (a) or (b) . 

2. A nucleic acid sequence as set out in any one of SEQ ID Nos 1 to 
1193, or a complement or allelic variation thereof. 

-3 3. A sequence as claimed in claim 2 and which exhibits no more than 
90% homology to a human sequence known per se. 

4. A nucleic acid fragment comprising a portion of a sequence as 
defined in claim 2 or claim 3 of sufficient size such that a probe of 
the same size and exhibiting complementarity to said portion can 
hybridize to said sequence as defined in claim 2 or claim 3. 
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5. A fragment as claimed in claim 4, wherein said portion is at 
least 15 bases in length. 

6. A fragment «-*s claimed in any one of claims 1 , 4 or 5 and encoding 
at least a portion of a biologically active polypeptide. 

7. A nucleic acid sequence as claimed in claim 2 or claim 3 and 
30 encoding at least a portion of a biologically active polypeptide. 

3. A DNA construct comprising a fragment as defined in any one of 
claims 1, 4, 5 cr 6 or a sequence as defined in any one of claims 2, 3 
or 7, together with a control cr regulatory sequence. 

35 



9. A construct as claimed in claim 3 which encodes a fusion protein 
comprising a known protein and the polypeptide encoded by said fragment 
or sequence. 

A0 I0 - A construct as claimed in claim 9, wherein the fusion protein 
encoded ir, * cleavable f^im rr^oi« v_,._^ . _^ ; rl 
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protein and said fragment or sequence. 

11. The use of a fragment as defined in any one of claims 1, 4, 5 or 
6 or a sequence as defined in any one of claims 2, 3 or 7 to produce a 
gene . 

12. A DNA fragment comprising a gene obtainable by the use defined in 
claim 11. 

13. An expression vector comprising a fragment as defined in any one 
of claims 1, 3, 5 or 6, a sequence as defined in any one of claims 2, 
3 or 7, a DNA construct as defined in any one of claims 8 to 10, or a 
DNA fragment as claimed in claim 12, positioned such that that nucleic 
acid sequence which encodes the polypeptide corresponding to said 
fragment, sequence or DNA fragment is in operable reading frame with a 
control or regulatory sequence. 

14. A vector as claimec i claim 13, wherein said vector control or 
regulatory sequence comprises a regulatable promoter. 

15. Host cells which incorporate as a heterologous part of their 
expressible genetic information a fragment as defined in any one of 
claims 1, 3, 5 or 6, a sequence as defined in any one of claims 2, 3 or 
7, or a DNA fragment as defined in claim 12. 

16. A process for the production of a polypeptide comprisinn 
cultivating host cells as defined in claim 15. 

17. An antibody directed against a polypeptide obtainable by the 
performance of a process as defined i n claim 16. 

18. An antibody as clM^d in cla im .7 and which is monoclonal. 



19. 



A novel gene product, or portion thereof encoded by a fragment c-.s 
wi v.loirr.5 1 , 3 , 5 or 0, or encoded by a sequence as 
defined in any cne of claims 2, 3 or 7, or encoded by the gene 
comprised in a DNA fraonent as defined in claim 12. 
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